PS-10232 [8.0]: Fix performance regression by limiting `rows_in_buffer` to query `LIMIT` #5774

inikep · 2025-12-03T14:19:59Z

Issue: rows_in_buffer could exceed the number of rows requested by a SELECT ... LIMIT query. This could lead to unnecessary memory usage and inefficient fetching.

Fix: Adjust rows_in_buffer to respect the SELECT query's LIMIT. Now, rows_in_buffer is capped at the query limit, aligning buffer allocation with the expected number of rows.

…r` to query `LIMIT` Issue: `rows_in_buffer` could exceed the number of rows requested by a `SELECT ... LIMIT` query. This could lead to unnecessary memory usage and inefficient fetching. Fix: Adjust `rows_in_buffer` to respect the `SELECT` query's `LIMIT`. Now, `rows_in_buffer` is capped at the query limit, aligning buffer allocation with the expected number of rows.

dinodork · 2025-12-04T09:29:29Z

sql/sql_executor.cc

+      ulonglong select_limit =
+          std::max<ulonglong>(q_block->select_limit->val_uint(), 2);
+      rows_in_buffer = std::min(rows_in_buffer, select_limit);
+    }


The solution works, and I notice a speedup of about a factor three. I don't see any serious problems with it. It's not ideal that the executor peeks into the AST tree, however. My take on the coding pattern is that this type of information should be supplied by the iterator. This is probably something they would pick on if we proposed the patch upstream.

Before the regression, set_record_buffer() would cap the number of rows in the buffer at max_rows. So a more conservative approach would be to reintroduce that cap instead of peeking at the LIMIT, I guess?

What do you think?

dinodork reviewed Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PS-10232 [8.0]: Fix performance regression by limiting `rows_in_buffer` to query `LIMIT` #5774

PS-10232 [8.0]: Fix performance regression by limiting `rows_in_buffer` to query `LIMIT` #5774

inikep commented Dec 3, 2025

Uh oh!

dinodork Dec 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PS-10232 [8.0]: Fix performance regression by limiting rows_in_buffer to query LIMIT #5774

Are you sure you want to change the base?

PS-10232 [8.0]: Fix performance regression by limiting rows_in_buffer to query LIMIT #5774

Conversation

inikep commented Dec 3, 2025

Uh oh!

dinodork Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PS-10232 [8.0]: Fix performance regression by limiting `rows_in_buffer` to query `LIMIT` #5774

PS-10232 [8.0]: Fix performance regression by limiting `rows_in_buffer` to query `LIMIT` #5774

dinodork Dec 4, 2025 •

edited

Loading